Spice it up? Mining Refinements to Online Instructions from User Generated Content
نویسندگان
چکیده
There are a growing number of popular web sites where users submit and review instructions for completing tasks as varied as building a table and baking a pie. In addition to providing their subjective evaluation, reviewers often provide actionable refinements. These refinements clarify, correct, improve, or provide alternatives to the original instructions. However, identifying and reading all relevant reviews is a daunting task for a user. In this paper, we propose a generative model that jointly identifies user-proposed refinements in instruction reviews at multiple granularities, and aligns them to the appropriate steps in the original instructions. Labeled data is not readily available for these tasks, so we focus on the unsupervised setting. In experiments in the recipe domain, our model provides 90.1% F1 for predicting refinements at the review level, and 77.0% F1 for predicting refinement segments within reviews.
منابع مشابه
Knowledge Discovery from Online Communities
During recent years, the outstanding growth of social network communities has caught the attention of the research community. A huge amount of user-generated content is shared among community users and gives researchers the unique opportunity to thoroughly investigate social community behavior. Many studies have been focused on both developing models to investigate user and collective behavior ...
متن کاملEffective web log mining and online navigational pattern prediction
The web has become the world's largest repository of knowledge. Web usage mining is the process of discovering knowledge from the interactions generated by the user in the form of access logs, cookies, and user sessions data. Web Mining consists of three different categories, namely Web Content Mining, Web Structure Mining, and Web Usage Mining (is the process of discovering knowledge from the ...
متن کاملMine Your Own Business: Market-Structure Surveillance Through Text Mining
W 2.0 provides gathering places for Internet users in blogs, forums, and chat rooms. These gathering places leave footprints in the form of colossal amounts of data regarding consumers’ thoughts, beliefs, experiences, and even interactions. In this paper, we propose an approach for firms to explore online user-generated content and “listen” to what customers write about their and their competit...
متن کاملAnalyzing user-generated online content for drug discovery: development and use of MedCrawler
Motivation Ethnopharmacology, or the scientific validation of traditional medicine, is a respected starting point in drug discovery. Home remedies and traditional use of plants are still widespread, also in Western societies. Instead of perusing ancient pharmacopeias, we developed MedCrawler, which we used to analyze blog posts for mentions of home remedies and their applications. This method i...
متن کاملThe EconoMining project at NYU: Studying the economic value of user-generated content on the internet
An important use of the internet today is in providing a platform for consumers to disseminate information about products and services they buy, and share experiences about the merchants with whom they transact. Increasingly, online markets develop into social shopping channels, and facilitate the creation of online communities and social networks. Till date, businesses, government organisation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012